UNCLASSIFIED 


tn  264  783 


luf.  ike 

ARMED  SERVICES  TECHNICAL  INFORMATION  AGENCY 
ARLINGTON  HALL  STATION 
ARLINGTON  12,  VIRGINIA 


UNCLASSIFIED 


NOTICE:  When  government  or  other  drawings,  speci¬ 
fications  or  other  data  are  used  for  any  purpose 
other  than  In  connection  with  a  definitely  related 
government  procurement  operation,  the  U.  S. 

Government  thereby  incurs  no  responsibility,  nor  any 
obligation  whatsoever;  and  the  fact  that  the  Govern¬ 
ment  may  have  formulated,  furnished,  or  in  any  way 
supplied  the  said  drawings,  specifications,  or  other 
data  is  not  to  be  regarded  by  implication  or  other¬ 
wise  as  in  any  manner  licensing  the  holder  or  any 
other  person  or  corporation,  or  conveying  any  rights 
or  permission  to  manufacture,  use  or  sell  any 
patented  Invention  that  may  in  any  way  be  related 
thereto. 


00 

<0 


'  1  , 


>v 


ThU  is  a  working  paper.  it  may  bo  expanded, 
modified,  or  withdrawn  at  any  time.  The  views, 
conclusions,  and  recommendations  expressed  herein 
do  not  necessarily  reflect  the  official  views 
or  policies  of  the  United  States  Air  Force. 


m  aow 


nEitrs 

s  9m  0 


ik 

SANTA  MONICA  •  CAIIPORNIA 


U,  S.  AIR  FORCE 

PROJECT  RAND 

RESEARCH  MEMORANDUM 


OPTIMAL  TIMING  IN  MISSILE  LAUNCHING: 
A  GAME-THEORETIC  ANALYSIS 


Melvin  Dresher 
Selmer  Johnson 


RM— 2723 

March  1,  1961 

V _ ^ _ J 


Assigned  to _ 


Thii  research  it  sponsored  by  the  United  States  Air  Force  under  contract  No. 
AF  49(638)-700  monitored  by  the  Directorate  of  Development  Planning,  Deputy 
Chief  of  Staff,  Development,  Hq  USAF. 

This  is  a  working  paper  It  may  be  expanded,  modified,  or  withdrawn  at  any 
time.  The  views,  conclusions,  and  recommendations  expressed  herein  do  net 
necessarily  reflect  the  officiol  views  or  policies  of  the  United  States  Air  Force. 


1U 


mud 


1700  MAIN  ST.  •  SANTA  MONICA  »  CALIFORNIA 


RM— 2723 
ill 


SUMMARY 

Suppose  that  Blue  plans  to  launch  a  missile  during  some  specified 
time  interval.  Prior  to  a  launching,  Blue  must  expose  the  missile  for  a 
time  period,  during  which  the  missile  is  vulnerable  to  attack  and  may  be 
destroyed  by  Red.  Prior  to  its  exposure,  Blue's  missile  is  assumed  to 
be  underground,  where  it  may  be  pinned  down  for  a  time  interval  or 
possibly  destroyed  by  a  Red  missile.  What  is  the  optimal  time  for  Blue 
to  launch  his  missile,  and  when  should  Red  attack  Blue? 

This  scheduling  problem  is  formulated  and  analyzed  as  a  two-person 
game.  We  consider  various  game  models  in  which  the  above  three  typ?s 
of  vulnerability  are  introduced  as  stochastic  elements.  The  optimal 
strategies,  i.e.,  the  optimal  launching  times,  are  described. 


RM— 2723 

v 


CONTENTS 

SUMMARY  .  ill 

Section 

1.  INTRODUCTION .  1 

2.  THE  GAME  SITUATION .  1 

3.  DEFINITIONS  OF  PARAMETERS .  3 

4.  MODEL  I.  BLUE  VULNERABLE  PRIOR  TO 

AND  DURING  EXPOSURE  .  4 

5.  MODEL  II.  BLUE  VULNERABLE  PRIOR  TO 

AND  DURING  EXPOSURE  AND  PINDOWN .  7 

6.  MODEL  III.  EQUIVALENT  FINITE  GAME  .  9 

7.  MODEL  IV.  RED  HAS  R  RELIABLE  MISSILES  .  12 

8.  MODEL  V.  RED'S  MISSILES  ARE  NOT  RELIABLE  .  15 

9.  MODEL  VI.  FINITE  GAME  WITH  PINDOWN  (B  -  1) .  17 

REFERENCES . 23 


OPTIMAL  TIMING  IN  MISSILE  LAUNCHING: 
A  GAME-THEORETIC  ANALYSIS 


L  INTRODUCTION 

The  problem  of  scheduling  the  launching  of  a  single  missile  that 
must  first  be  exposed  for  a  finite  time  interval,  during  which  it  may  be 
attacked  by  an  enemy  missile,  has  been  formulated  and  solved  as  an 
infinite  game  in  [  1  ]  .  This  scheduling  problem  was  subsequently 
generalized  to  an  arbitrary  number  of  Blue  and  Red  missiles,  and  the 
generalized  problem  was  solved  in  [2]  by  analyzing  an  equivalent  finite 
game.  In  each  case  it  was  assumed  that  Blue  was  vulnerable  to  attack 
only  during  the  exposure  period. 

We  now  wish  to  study  the  scheduling  problem  on  the  assumption 
that  Blue  is  vulnerable  to  attack  before  exposure  —  i.e.  ,  while  Blue 
is  underground  —  as  well  as  during  exposure.  Additionally,  we  wish 
to  study  the  effect  of  pindown  on  Blue's  optimal  scheduling.  Finally,  we 
wish  to  analyze  the  effect  of  the  reliability  of  Red's  missiles  on  Blue's 
schedule. 

Accordingly,  in  this  paper  we  shall  first  describe  a  series  of  game 
models  to  study  the  above  factors.  We  shall  then  formulate  these  models 
mathematically,  and  shall  present  the  solution  to  some  of  them. 

2.  THE  GAME  SITUATION 

The  general  game  situation  that  we  wish  to  model  and  analyze  is 
the  following.  Blue  has  B  missiles  that  he  wishes  to  launch  before  T 
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minutes  have  expired,  and  Red  ha3  R  missiles  with  which  to  attack  Blue's 
missiles  during  these  T  minutes.  In  order  to  launch  a  missile,  Blue 
must  expose  it  for  L  minutes,  during  which  time  it  may  be  destroyed 
(killed  while  soft)  by  a  Red  missile.  Prior  to  exposure,  while  Blue's 
missile  is  underground,  the  missile  may  be  destroyed  (killed  while  hard), 
or  the  missile  launching  may  be  postponed  for  a  time  interval  (pinned  down) 
if  a  Red  missile  arrives  during  this  hardened  period.  Further,  a  given 
Red  missile  may  not  be  reliable  and  thus  may  fail  to  function. 

Now  in  this  game  situation  Blue's  object  is  simply  to  launch  his 
missiles.  Hence,  we  shall  make  the  payoff  to  Blue  the  number  of  missiles 
Blue  succeeds  in  launching. 

It  is  clear  that  a  strategy  for  Blue  is  a  choice  of  B  points  on  the 
time  interval  [0,  T]  at  which  to  begin  the  exposure  of  his  missiles. 
Similarly,  a  strategy  for  Red  is  a  choice  of  R  points  on  the  same  time 
interval  when  Red's  missiles  are  to  arrive  at  Blue’s  sites.  Depending 
on  the  amount  of  information  we  permit  the  two  players,  these  choices 
of  times  may  be  functions  of  the  number  of  missiles  Blue  and  Red  have. 

Since  a  strategy  is  a  choice  of  points  on  an  interval,  the  game  is 
an  infinite  game.  Generally,  however,  we  may  consider  an  equivalent 
finite  game  in  which  Red  chooses  R  intervals  and  Blue  chooses  B 
intervals.  In  most  cases  it  will  be  easier  to  solve  the  equivalent  finite 
game  than  the  original  infinite  game. 
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3.  DEFINITIONS  OF  PARAMETERS 

In  the  above  game  it  Is  assumed  that  both  sides  know  values  of  the 
following  parameters: 

T  «  number  of  minutes  Blue  and  Red  have  to  launch  their  missiles 
■  duration  of  game. 

L  *  number  of  minutes  a  Blue  missile  muBt  be  exposed  prior  to 
launching  ■  length  of  exposure  period. 

N  ■  [T/L]  ■  greatest  integer  less  than  or  equal  to  T/L 
*  number  of  exposure  periods  available  in  game. 

B  ■  number  of  missiles  Blue  has  at  start  of  game. 

R  ■  number  of  missiles  Red  has  at  start  of  game. 

r  *  probability  that  a  given  Red  missile  is  reliable. 

A  ■  probability  that  Blue's  missile  is  destroyed  by  a  Red  missile 
arriving  during  Blue's  missile  exposure  period. 

U  *=  probability  that  Blue's  missile  is  destroyed  by  a  Red  missile 
arriving  when  Blue's  missile  is  underground. 

F  *  probability  that  fallout  is  produced  by  a  Red  missile  arriving 
when  Blue's  missile  is  underground;  the  effect  of  fallout  is 
to  pin  down  Blue's  missile  underground  for  f  subsequent  periods. 

U1  ■  1  -(1  -U)(l  -  F)  »  probability  that  Blue's  missile  is  either 
destroyed  underground  or  pinned  down  by  a  Red  missile 
arriving  when  the  Blue  missile  is  underground. 
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a  ■  1  -rA  ■  probability  that  Blue's  missile  survives  If  exposed 
when  a  Red  missile  Is  scheduled  to  arrive. 

?  ■  l-rll  *  probability  that  Blue's  missile  survives  If  underground 
when  a  Red  missile  is  scheduled  to  arrive. 
y  m  1  —  rU1  ■  1  —  r[l  —  (1  —  U)(l  —  F)J  ■  probability  that  Blue's 
missile  survives,  and  Is  not  pinned  down,  if  underground 
when  a  Red  missile  is  scheduled  to  arrive. 

4.  MODEL  I.  BLUE  VULNERABLE  PRIOR  TO  AND  DURING  EXPOSURE 
In  [1]  and  [2]  it  was  assumed  that  Blue's  missiles  are  vulnerable 
to  attack  only  during  the  exposure  period.  We  shall  now  assume  that  the 
missiles  are  also  vulnerable  to  attack  durlng^the  period  prior  to  exposure, 
l.e. ,  when  the  missiles  are  underground  or  "hard.  "  Further,  we  shall 
treat  the  attack  outcomes  as  random  variables,  but  shall  omit  the  plndown 
effect  In  this  model. 

Suppose  Blue  and  Red  have  one  missile  each,  i.e.,  R  *  B  ■  1. 

Then  a  strategy  for  Blue  in  the  equivalent  finite  game  is  a  choice  of  a 
time  period  1,  where  1  <  i  <  N,  for  Blue  to  expose  his  missile  for  L 
minutes.  A  strategy  for  Red  is  a  choice  of  a  time  period  j,  where 
1  <  J  <  N,  during  which  Red's  missile  is  to  arrive  at  Blue's  launching 
site.  Since  the  payoff  is  +1  If  Blue  launches  his  missile  and  0  otherwise, 
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the  payoff  to  Blue  corresponding  to  each  pair  of  strategies  is  given  by 
the  matrix 


M 


<y 


a 

1 

1 

•  <  i 

1 

0 

• 

a 

• 

1 

• 

•  •  • 

1 

• 

0 

• 

• 

• 

•  •  • 

•  •  • 

.  .  a 

1 

\e 

0 

0 

.  •  0 

“1 

Each  row  of  this  matrix  represents  one  of  the  N  possible  exposure 
periods  for  Blue.  Each  column  represents  one  of  the  N  possible  arrival 
periods  for  Red. 

In  order  to  solve  this  game,  we  transform  the  payoff  matrix  M  to 
M1  by  setting 


I 


aij  ~  a 
1  -a  » 


and  defining 


P 


a  fi-a 
1  -  a 


Then  the  transformed  payoff  matrix  becomes 
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M'  =  (a*^) 


0 

P 

P 


P 
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1 

0 

P 


P 

P 


1 

1 

0 


p 

p 


\ 


/ 


By  solving  M  we  obca'n  the  following  results  for  the  original  game: 
(i)  The  optimal  strategy  for  Blue  is 

x.  =  L  '-Zfl  _L  for  1  <  i  <  N. 

1  N\  1  - 
(1  -  P  )  p 


(ii)  The  optimal  strategy  for  Red  is 


1  -  P 


1  N  y  • 

J  Pd-P1") 


for  1  <  j  <  N. 


(iii)  The  value  of  the  game  is  given  by 
N 


v.LuL  ,  i_ 

i-pn 


rU 


N  ‘ 


For  optimal  scheduling,  each  player  randomizes  over  each  of  the 
N  periods.  Note,  however,  that  the  probabilities  associated  with  each 
period  are  different  for  the  two  players.  As  can  be  seen  above,  Blue's 
probability  of  launching  increases  geometrically  with  time,  and  Red's 
probability  of  arrival  decreases  geometrically  with  time. 


5.  MODEL  II.  BLUE  VULNERABLE  PRIOR  TO 


RM— 2723 
7 


AND  DURING  EXPOSURE  AND  PIN  DOWN 

Let  us  now  assume  that  Blue's  missile  is  subject  to  pindown  as  well 
as  being  vulnerable  to  attack  prior  to  and  during  exposure.  Suppose  Blue 
and  Red  have  one  missile  each,  i.  e.,  R  ■  B  »  1.  Then  a  strategy  for 
Blue  is  a  choice  of  time  x  to  begin  exposing  his  missile,  where 
0  <  x  <  T  -  L.  A  strategy  for  Red  is  a  choice  of  a  time  y,  where 
0  <  y  <  T,  for  Red's  missile  to  arrive  at  Blue's  launching  site.  If  Blue 
is  attacked  when  his  missile  is  underground,  then  the  probability  that  Blue's 
missile  will  be  destroyed  is  U,  and  the  probability  that  his  missile  will  be 
pinned  down  for  f  subsequent  periods  is  F.  Let  us  assume  that  f  ■  2. 

From  the  above  description  of  the  game,  it  is  clear  that  each  player 
has  an  infinite  number  of  strategies.  The  payoff  to  Blue,  M(x,  y),  for 
each  pair  of  strategies,  (x,  y),  is  as  follows: 


1 

if 

x  +  L  <  y  <  T, 

a 

if 

x  <  y  <  x  +  L, 

P 

if 

x  >  y,  y  <  T  -  3L, 

y 

if 

x  >  y  >  T  -  3L. 

Although  this  is  an  infinite  game,  it  turns  out  that  only  a  finite 
number  of  strategies  will  be  involved  in  an  optimal  mixed  strategy  for 
each  player.  In  order  to  describe  the  optimal  mixed  strategies,  we  set, 
as  before, 
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.  £z_ 

l-o 


Now  let  us  define 


and 


- 

1  -  a  • 


p  -  \  +  X2  +  pX2, 


K  3  p 


n  N-3. 
(1  -  P  ) 

1  -  p 


It  can  be  verified  that  the  value  of  the  game  is  given  by 


1  - 


r  AU 


a/i  N-3.  A  TT  N-3/1 -X“ 
A(1  —  p  )  +  Up 


An  optimal  mixed  strategy  for  Blue  consists  of  choosing  one  of  the 
following  N  points  In  time  at  random: 

0,  L,  2L,  (N-4)L,  (N-3)L,  (N-2)L,  (N-l)L. 

These  points  have  the  following  prooabilities,  respectively: 

ji  N— 4  jr  N-5  ji  N-6  ^  X2  X^  J_ 

K  P  ’  K  P  ’  K  P  .  •  •  •  *  K»  K»  K  *  K  * 


An  optimal  mixed  strategy  for  Red  consists  of  choosing  one  of  the 
following  N  points  In  time  at  random: 


L,  2L,  3L,  (N-3)L,  (N-2)L,  (N-l)L,  NL. 
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These  N  points  have  the  following  probabilities,  respectively: 


-  * 
i  £  £_ 

C'  C'  C  1 


N— 4  N— 3  .  N-3  2  N-3 

£ _  £ _  b£ _  P 

>  c  *  c  *  c  '  c  * 


where 


,  N-3  ,  .3  „  _ 

1  —  p  ,  1  —  X  N— 3 

- K - - —  p 


1  -  p 


1  -X 


As  in  the  previous  model,  Blue's  probability  of  launching  increases 
with  time,  and  Red's  probability  of  attack  decreases  with  time. 


6.  MODEL  III.  EQUIVALENT  FINITE  GAME 

The  preceding  infinite  game  is  equivalent  to  the  following  finite 
game:  Blue  chooses  one  of  the  N  time  intervals  to  expose  his  missile 
for  L  minutes.  Red  chooses  one  of  the  N  time  intervals  during  which 
Red's  missile  is  to  arrive  at  Blue's  launching  site.  Thus  each  player 
has  N  strategies.  Let  the  pindown,  if  it  occurs,  be  for  f  periods. 
Then  the  NxN  payoff  matrix  becomes 


1  1  l\ 


M 


<v 


RM— 2723 
10 


In  order  to  solve  this  game,  it  is  convenient  to  make  the  following 


transformation: 


i  ai  i  ~  ° 


*ij  1  -  o  • 


The  payoff  matrix  for  the  transformed  game  now  becomes 


M*  *  (a|.) 


0  1  1 
p  0  1 

p  p  0 


P  P  P 
P  P  P 


1  1  1 
1  1  1 
1  1  1 
1  1  1 
0  1  1 


•  •  •  X  •  •  X  0  1 


•  •  X  •  •  X  X  0 


Let  Y  =  (y^  y2,  . . . ,  yN)  be  an  optimal  mixed  strategy  for  Red, 
and  let  v'  be  the  value  of  the  M*  game.  We  have  the  following  relations: 


p  E  y  +  E  y.  =  v 
j=l  J  j=i+l  J 

N— f— 1  i-1  N 

P  C  y.  +  X  E  y  +  Ey 
1  J  N-f  J  i+1  J 


if  i  <  N  -  f-  1, 


if  i  >  N  -  f . 


Solving  these  equations  for  y^,  we  get 


j"*"!  /  j 

p  (l-v) 


J  _v,( 


j  <  N-f,. 

J  >  N  -f  +  1, 
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where  v'  satisfies  the  following  relationship: 

N-f 


<l-v') 


JLzJl 


+  \p 


N-f-i  (1  -V) 
1 


1. 


Let  X  ■  (x^,  Xg,  . x^)  be  an  optimal  mixed  strategy  for  Blue. 
It  follows  that 


j-1  N 

E  x  +  p  E  x  -  v'  if  j  <  N-f-1, 

1  J+l 

j-1  N 

E  x  +  \  E  x  -  v  if  j  >  N  -  f. 

1  j+l 


From  the  above  equations,  we  get  Blue's  optimal  strategy: 


1 


i— 1 
P 


(1 


if  i  <  N  -  f, 


x,  = 


1 


.  i-N+f  N-f-1 


«-7> 


if  i  >  N  -f  +  1. 


In  order  to  determine  the  value  v  of  the  original  game,  we  make 
use  of  the  fact  that 

v  =  (1  —  a)v'  +  a  , 

and  of  the  relationship  (1)  for  v'.  We  get 


v  =  1  - 


1  -  a 


1  —  P 


N-f-1 


1  -  p 


+  P 


f+1 

N-f-i  (1  -  X1  A) 
1  -  \ 


Perhaps  the  most  outstanding  feature  of  the  optimal  strategies  is 
that  Blue  and  Red  randomize  over  their  respective  strategies  in  opposite 
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ways.  Blue's  probability  of  missile  exposure  increases  with  time;  i.e., 
he  tends  to  postpone  action.  Red's  probability  of  missile  firing  decreases 
with  time;  i.e.,  he  tends  to  fire  early. 

7.  MODEL  IV.  RED  HAS  R  RELIABLE  MISSILES 

Suppose  Blue  has  one  missile  and  Red  has  several  missiles,  but 
each  missile  is  reliable.  Since  we  have  assumed  that  Blue  knows  the 
number  of  missiles  that  Red  has  at  the  beginning  of  the  game,  it  is 
reasonable  to  assume  that  Blue  knows  at  any  given  time  the  number  of 
missiles  Red  has.  Further,  Red  may  play  at  each  instant  as  if  Blue  has 
one  missile.  This  yields  a  recursive  game. 

Let  v(m,  n)  define  the  value  of  the  game  in  which  Red  has  n  missiles 
at  the  beginning  of  the  in— period  game.  Then  at  the  initial  period  of  this 
game  Red  has  n+1  choices:  n,  n  —  1,  ...,  2,  1,  0  missiles  to  arrive  in 
that  period.  Blue  has  two  choices:  0  or  1  missile  to  expose  in  that  period. 
The  payoff  is  given  by  the  following  2x(n  +  l)  matrix: 


Blue 

Red  Choice 

Choice 

n 

n  -  1 

...  1 

0 

0 

n 

a 

n— 1 
a 

•  •  •  a 

1 

1 

Pn 

j3n  *v(m  — 1, 

1)  •  •  •  j3v(m  —  1,  n—  1) 

v(m-rl,  n) 

We  have  the  initial  condition  v(m,  0)  »  1.  Using  the  above  matrix, 
we  can  compute  values  of  v(m,  n)  recursively  for  all  m  <  N  and  n  <  R. 
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We  can  also  make  use  of  the  result  from  the  previous  section  that 


v(m,  1)  «  1  - 


1-0 


,a  m 

-(f=S) 


1  “  P 


n 


n 


It  is  obvious  that 

v(l,  n)  »  o11. 

For  example,  in  order  to  compute  v(2,  2),  we  use  the  payoff 
matrix 

a2  a  1 

02  a/3  a2 


For  the  special  case  in  which  A  =  1,  i.e. ,  an  exposed  missile 
will  be  destroyed  when  a  Red  missile  arrives,  we  can  obtain  an  explicit 
solution.  First,  it  is  evident  that  Red  should  never  salvo.  Hence  a 
strategy  for  Red  is  the  choice  of  zero  or  one  missile  during  each  period. 
This  yields  a  series  of  2  x  2  games  that  earn  be  solved  recursively. 

We  can  also  solve  this  special  case,  A  =  1,  directly.  A  strategy 
for  Blue  is  a  choice  of  a  time  period  to  expose  this  missile.  A  strategy 
for  Red  is  a  choice  of  R  of  the  N  time  periods  for  his  R  missiles  to 
arrive.  For  example,  if  R  =  2  and  N  =  4,  we  have  the  following 
strategies  and  payoffs: 
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Blue 

Strategy 

Red  Strategy 

43 

42 

41 

32 

31 

21 

4 

0 

0 

0 

1 

1 

1 

3 

0 

0 

0 

0 

0 

1 

2 

p2 

0 

0 

0 

0 

0 

1 

e2 

e2 

0 

<i2 

0 

0 

An  optimal  strategy  for  Blue  is  to  expose  his  missile  at  period 
4,  3,  2,  and  1  with  relative  frequencies 

p3,  p3,  p,  i, 


respectively. 

An  optimal  strategy  for  Red  is  to  choose  one  of  his  six  strategies 
with  relative  frequencies 

1,  0,  02,  02,  03,  04. 


In  particular,  if  we  define  y^  to  be  the  probability  that  Red's  first 
missile  arrives  during  period  4,  then 


The  value  of  the  game  is  given  by 


02-0 
-04 


v  B 
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By  the  above  procedure,  the  value  is  readily  shown  for  arbitrary 

R  and  N,  where  R  <  N,  to  be 

qR  ftN  flR  N 
..  ,  P-  ~  P  -  =  P  ~  P 


1-0 


N 


1  -  P 


N 


The  optimal  relative  frequencies  for  Blue  are 

aN— 1  ftN-2  .  . 

P  f  P  f  •  •  •  f  Hp  1 • 


An  optimsd  strategy  for  Red,  if  expressed  explicitly,  is  very  complicated. 
At  any  stage  (R,  N),  however,  the  optimal  strategy  has  the  property 
that 

,R 


1—0 


N 


1-0 


N  ’ 


As  0  —  1,  we  obtain  the  uniform  distribution  given  in  [2] 


8.  MODEL  V.  RED'S  MISSILES  ARE  NOT  RELIABLE 

We  shall  now  generalize  the  preceding  model  by  assuming  that 
r  <  1,  i.e.,  that  Red's  missiles  may  not  be  reliable,  and  A  <  1.  This 
changes  the  complete— information  game  to  a  game  with  incomplete 
information.  Nevertheless,  some  of  the  results  of  the  preceding  game 
carry  over.  We  shall  describe  some  partial  results  regarding  this  type 
of  game. 

Suppose  that  R  *  2  and  N  >  3.  Let  Bq  be  the  Blue  mixed  strategy 

N— 1  N— 2  2 

defined  by  the  relative  frequencies  p  ,  p  ,  ...,  p  ,  p,  1.  It  can 
be  shown  that  if  Blue  plays  B^,  then  Red  should  not  salvo.  Thus,  if 
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N  ■  4  and  if  we  omit  the  Red  salvo  strategies,  we  have  the  following 
payoff  matrix: 


43 

42 

41 

32 

31 

31 

4 

1° 

a 

a 

1 

1 

>  \ 

3 

<*P 

P 

P 

a 

a 

1 

2 

P2 

aP 

P 

ap 

P 

a 

1 

w 

fi2 

aP 

o2 

ap 

ap  y 

Each  of  these  six  Red  strategies  yields  the  same  expected  payoff 
against  Blue's  BQ  strategy.  Further,  if 


a  < 


P 


R 


N 

N 
-  P 


> 


where  R  is  arbitrary,  it  is  possible  to  find  a  mixture  of  nonsalvo  Red 
strategies  that  yields  this  same  expected  payoff  against  each  of  Blue's 
pure  strategies.  Moreover,  the  nonsalvo  strategies  dominate  the  salvo 
strategies  if  Blue  uses  his  optimal  strategy  B^. 

Thus,  for  arbitrary  R  and  N,  the  value  of  the  game  is 


v 


Hence  Blue's  optimal  strategy  is  independent  of  Red's  strength. 

In  this  game  Blue  would  get  information  at  the  end  of  each  period 
only  if  a  reliable  Red  missile  had  been  fired  during  that  period.  Blue 
would  therefore  know  which  subgame  he  is  playing.  If  Red's  missile 
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had  been  unreliable,  however,  then  Blue  would  not  know  Red's  resources 
for  the  remaining  periods. 

Suppose  Blue  has  B  missiles,  B  >  1,  which  may  be  launched 
only  one  per  time  period.  Because  of  the  information  about  an  opponent's 
resources,  Blue  and  Red  have  a  very  large  number  of  strategies.  If, 
however, 


a  < 


N 

H£_ 
N 
■  P 


and 


B  < 


t 


then  we  can  show  that,  for  arbitrary  N  and  R,  the  optimal  scheduling 
for  each  side  is  independent  of  the  opponent's  resources. 


9.  MODEL  VI.  FINITE  GAME  WITH  PINDOWN  (B  *  1) 

In  the  case  of  the  finite  game  with  pindown  a  possibility,  both  sides 
have  incomplete  information,  but  we  shall  assume  that  neither  side  can 
act  on  the  information  during  the  play  of  the  game.  Each  side  formulates 
a  firing  schedule  in  advance  and  will  not  change  it  during  the  game,  except 
to  postpone  exposure  when  forced  to  do  so  by  pindown. 

Such  a  game  has  a  large  number  of  strategies.  For  example,  if 
N  =  4,  R  =  2,  we  have  the  following  payoff  matrix: 
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Blue 

Red  Strategy 

Strategy 

44 

43 

42 

41 

32 

31 

22 

21 

11 

4 

2 

a 

a 

a 

a 

1 

1 

1 

1 

1 

1 

3 

32 

ya+{&-y)y 

y>  \HP-y)y 

y  l+(P-y)a 

2 

a 

□ 

B 

1 

1 

1 

2 

32 

fiy 

ay  +(P-y)y 

v  l  HP-y)<* 

2 

y 

1 

B 

2 

a 

a 

1 

1 

S2 

Py 

Py 

Pa 

2 

Y 

| 

2 

y 

ya 

2 

a 

We  note  from  the  payoff  matrix  that  if  Blue  chooses  his  3  strategy 
and  Red  chooses  his  43  strategy,  then  the  payoff  is  ya  +  (P  —  y)y .  This 
is  obtained  as  follows:  Either  there  is  pindown  or  there  is  none,  due  to 
Red's  first  missile  arriving  in  the  4th  period.  If  there  is  no  pindown, 
Blue's  missile  survives  while  underground  with  probability  y.  Then  he 
will  proceed  as  scheduled  to  launch  his  missile  during  the  3rd  period. 
With  probability  a  he  will  survive  Red's  second  missile  scheduled  to 
arrive  in  that  period. 

On  the  other  hand,  if  there  is  pindown  from  Red's  first  missile, 
then  Blue  must  postpone  his  launching  to  period  one,  the  last  period.  In 
that  case,  the  probability  that  he  will  survive  Red's  second  missile  is  y. 

In  this  way,  each  Red  missile  arriving  before  the  (f+l)th  period 
from  the  end  gives  rise  to  2  cases  (pindown  or  no  pindown).  The  entries 
in  the  payoff  matrix  for  R  missiles  can  be  expressed  as  weighted  sums 
of  two  entries  taken  from  game  matrices  for  smaller  numbers  of  Red 


missiles. 
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While  these  large  games  look  hopelessly  complicated  at  first  glance, 
there  is  hope  of  solution  for  certain  ranges  of  the  probabilities  when  R  is 
small.  Approximations  to  this  model  can  be  worked  out  sequentially. 

As  an  approximation  to  this  model,  we  can  set  up  the  corresponding 
sequential  game  in  which  both  sides  have  complete  information  at  each 
stage.  At  each  stage  or  period,  Red  will  have  R1  +  1  choices,  and  Blue 
will  have  two  choices  at  most.  Recursive  relations  can  be  built  up  from 
matrices  similar  to  the  following  game  matrix  illustrating  the  case 
R  *  3,  N  *  4: 


Blue 

Red  Strategy 

Strategy 

(4,  3) 

(4,  2) 

(4,  1) 

(4,  0) 

(4,  1) 

a3 

a2 

a 

1 

(4,  0) 

y3V<3.  0)  + 
(0-y)3V<l,  0) 

y2V(3,  l)  + 
(0-y)2V(l,  l) 

yV(  3,  2)  + 

(/3-y)V(l,  2) 

V(3,  3) 

where  V(n',  r')  =  the  value  of  the  subgame  with  N'  periods  and  R*  Red 
missiles  left. 

Given  the  parameters,  we  can  solve  the  games  by  using  these 
recursive  relations.  Figures  1  and  2  present  the  game  values  for  two 
different  sets  of  parameters. 
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Fig.  I  — Gome  values  for  parameters 
r  *  0.7,  U«  0.15,  F s  0.7,  A«l 
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